Overview

Dataset Statistics

Number of Variables 12
Number of Rows 1599
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 240
Duplicate Rows (%) 15.0%
Total Size in Memory 150.0 KB
Average Row Size in Memory 96.1 B
Variable Types
  • Numerical: 11
  • Categorical: 1

Dataset Insights

residual sugar is skewed Skewed
chlorides is skewed Skewed
sulphates is skewed Skewed
Dataset has 240 (15.01%) duplicate rows Duplicates
quality has constant length 1 Constant Length
citric acid has 132 (8.26%) zeros Zeros

Variables


fixed acidity

numerical

Approximate Distinct Count 96
Approximate Unique (%) 6.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 8.3196
Minimum 4.6
Maximum 15.9
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • fixed acidity is skewed right (γ1 = 0.9818)

Quantile Statistics

Minimum 4.6
5-th Percentile 6.1
Q1 7.1
Median 7.9
Q3 9.2
95-th Percentile 11.8
Maximum 15.9
Range 11.3
IQR 2.1

Descriptive Statistics

Mean 8.3196
Standard Deviation 1.7411
Variance 3.0314
Sum 13303.1
Skewness 0.9818
Kurtosis 1.1249
Coefficient of Variation 0.2093
  • fixed acidity is not normally distributed (p-value 3.0449813776924647e-05)
  • fixed acidity has 49 outliers

volatile acidity

numerical

Approximate Distinct Count 143
Approximate Unique (%) 8.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 0.5278
Minimum 0.12
Maximum 1.58
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • volatile acidity is skewed right (γ1 = 0.671)

Quantile Statistics

Minimum 0.12
5-th Percentile 0.27
Q1 0.39
Median 0.52
Q3 0.64
95-th Percentile 0.84
Maximum 1.58
Range 1.46
IQR 0.25

Descriptive Statistics

Mean 0.5278
Standard Deviation 0.1791
Variance 0.03206
Sum 843.985
Skewness 0.671
Kurtosis 1.218
Coefficient of Variation 0.3392
  • volatile acidity has 19 outliers

citric acid

numerical

Approximate Distinct Count 80
Approximate Unique (%) 5.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 0.271
Minimum 0
Maximum 1
Zeros 132
Zeros (%) 8.3%
Negatives 0
Negatives (%) 0.0%
  • citric acid is skewed right (γ1 = 0.318)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.09
Median 0.26
Q3 0.42
95-th Percentile 0.6
Maximum 1
Range 1
IQR 0.33

Descriptive Statistics

Mean 0.271
Standard Deviation 0.1948
Variance 0.03795
Sum 433.29
Skewness 0.318
Kurtosis -0.7903
Coefficient of Variation 0.7189
  • citric acid is not normally distributed (p-value 5.524924957027384e-07)
  • citric acid has 1 outliers

residual sugar

numerical

Approximate Distinct Count 91
Approximate Unique (%) 5.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 2.5388
Minimum 0.9
Maximum 15.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • residual sugar is skewed right (γ1 = 4.5364)

Quantile Statistics

Minimum 0.9
5-th Percentile 1.59
Q1 1.9
Median 2.2
Q3 2.6
95-th Percentile 5.1
Maximum 15.5
Range 14.6
IQR 0.7

Descriptive Statistics

Mean 2.5388
Standard Deviation 1.4099
Variance 1.9879
Sum 4059.55
Skewness 4.5364
Kurtosis 28.5244
Coefficient of Variation 0.5554
  • residual sugar is not normally distributed (p-value 8.435114675594647e-14)
  • residual sugar has 155 outliers

chlorides

numerical

Approximate Distinct Count 153
Approximate Unique (%) 9.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 0.08747
Minimum 0.012
Maximum 0.611
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • chlorides is skewed right (γ1 = 5.675)

Quantile Statistics

Minimum 0.012
5-th Percentile 0.054
Q1 0.07
Median 0.079
Q3 0.09
95-th Percentile 0.1261
Maximum 0.611
Range 0.599
IQR 0.02

Descriptive Statistics

Mean 0.08747
Standard Deviation 0.04707
Variance 0.002215
Sum 139.859
Skewness 5.675
Kurtosis 41.5817
Coefficient of Variation 0.5381
  • chlorides is not normally distributed (p-value 5.685351342400229e-16)
  • chlorides has 112 outliers

free sulfur dioxide

numerical

Approximate Distinct Count 60
Approximate Unique (%) 3.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 15.8749
Minimum 1
Maximum 72
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • free sulfur dioxide is skewed right (γ1 = 1.2494)

Quantile Statistics

Minimum 1
5-th Percentile 4
Q1 7
Median 14
Q3 21
95-th Percentile 35
Maximum 72
Range 71
IQR 14

Descriptive Statistics

Mean 15.8749
Standard Deviation 10.4602
Variance 109.4149
Sum 25384
Skewness 1.2494
Kurtosis 2.0135
Coefficient of Variation 0.6589
  • free sulfur dioxide is not normally distributed (p-value 0.00025235789277722187)
  • free sulfur dioxide has 30 outliers

total sulfur dioxide

numerical

Approximate Distinct Count 144
Approximate Unique (%) 9.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 46.4678
Minimum 6
Maximum 289
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • total sulfur dioxide is skewed right (γ1 = 1.5141)

Quantile Statistics

Minimum 6
5-th Percentile 11
Q1 22
Median 38
Q3 62
95-th Percentile 112.1
Maximum 289
Range 283
IQR 40

Descriptive Statistics

Mean 46.4678
Standard Deviation 32.8953
Variance 1082.1024
Sum 74302
Skewness 1.5141
Kurtosis 3.7942
Coefficient of Variation 0.7079
  • total sulfur dioxide is not normally distributed (p-value 6.3101627851667615e-06)
  • total sulfur dioxide has 55 outliers

density

numerical

Approximate Distinct Count 436
Approximate Unique (%) 27.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 0.9967
Minimum 0.9901
Maximum 1.0037
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • density is skewed right (γ1 = 0.0712)

Quantile Statistics

Minimum 0.9901
5-th Percentile 0.9936
Q1 0.9956
Median 0.9968
Q3 0.9978
95-th Percentile 1
Maximum 1.0037
Range 0.01362
IQR 0.002235

Descriptive Statistics

Mean 0.9967
Standard Deviation 0.001887
Variance 3.562e-06
Sum 1593.7979
Skewness 0.07122
Kurtosis 0.9274
Coefficient of Variation 0.001893
  • density has 45 outliers

pH

numerical

Approximate Distinct Count 89
Approximate Unique (%) 5.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 3.3111
Minimum 2.74
Maximum 4.01
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • pH is skewed right (γ1 = 0.1935)

Quantile Statistics

Minimum 2.74
5-th Percentile 3.06
Q1 3.21
Median 3.31
Q3 3.4
95-th Percentile 3.57
Maximum 4.01
Range 1.27
IQR 0.19

Descriptive Statistics

Mean 3.3111
Standard Deviation 0.1544
Variance 0.02384
Sum 5294.47
Skewness 0.1935
Kurtosis 0.8007
Coefficient of Variation 0.04663
  • pH is not normally distributed (p-value 0.0009526755107512645)
  • pH has 35 outliers

sulphates

numerical

Approximate Distinct Count 96
Approximate Unique (%) 6.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 0.6581
Minimum 0.33
Maximum 2
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • sulphates is skewed right (γ1 = 2.4264)

Quantile Statistics

Minimum 0.33
5-th Percentile 0.47
Q1 0.55
Median 0.62
Q3 0.73
95-th Percentile 0.93
Maximum 2
Range 1.67
IQR 0.18

Descriptive Statistics

Mean 0.6581
Standard Deviation 0.1695
Variance 0.02873
Sum 1052.38
Skewness 2.4264
Kurtosis 11.6799
Coefficient of Variation 0.2576
  • sulphates is not normally distributed (p-value 5.623752235957702e-07)
  • sulphates has 59 outliers

alcohol

numerical

Approximate Distinct Count 65
Approximate Unique (%) 4.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 25584
Mean 10.423
Minimum 8.4
Maximum 14.9
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • alcohol is skewed right (γ1 = 0.86)

Quantile Statistics

Minimum 8.4
5-th Percentile 9.2
Q1 9.5
Median 10.2
Q3 11.1
95-th Percentile 12.5
Maximum 14.9
Range 6.5
IQR 1.6

Descriptive Statistics

Mean 10.423
Standard Deviation 1.0657
Variance 1.1356
Sum 16666.35
Skewness 0.86
Kurtosis 0.1957
Coefficient of Variation 0.1022
  • alcohol is not normally distributed (p-value 0.0005879379978337821)
  • alcohol has 13 outliers

quality

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 105534

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 5
2nd row 5
3rd row 5
4th row 6
5th row 5

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1599
  • The top 2 categories (5, 6) take over 50.0%
  • quality has words of constant length

Interactions

Correlations

Missing Values